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Abstract 

RNA editing is a post-transcriptional process that can act upon transcripts fronn nnitochondrial, nuclear, and chloroplast genonnes. 
In chloroplasts, single-nucleotide conversions in mRNAs via RNA editing occur at different frequencies across the plant kingdom. 
These range fronn several hundred edited sites in sonne nnosses and ferns to lower frequencies in seed plants and the complete lack of 
RNA editing in the liverwort Marchantia polymorpha. Here, we report the sequence and edited sites of the chloroplast genome from 
theliverwortPe///aend/V//fo//a.Thetypeandfrequencyof chloroplast RNA editing display a pattern highlysimilartothatinseed plants. 
Analyses of the C to U conversions and the genomic context in which the editing sites are embedded provide evidence in favor of the 
hypothesis that chloroplast RNA editing evolved to compensate mutations in the first land plants. 
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Introduction 

Chloroplast genomes descend from the genome of a free- 
living, cyanobacterial-like ancestor, which was engulfed by a 
eukaryotic cell and reduced to a chloroplast (Martin et al. 
2002; Hempel et al. 2007; Bolte et al. 2009). Several genomic 
characteristics of cyanobacteria, such as operon structures, are 
still maintained in chloroplast genomes. However, other attri- 
butes such as various transcript maturation processes or the 
phage-type RNA polymerase are lacking in cyanobacteria and 
hence can be thought to have arisen during plastid evolution 
(Maier et al. 2008; Stern et al. 201 0; Tillich and Krause 201 0). 
This is arguably true for RNA editing in plastids as well, that is, 
the post-transcriptional alteration of one to a few bases in 
maturing mRNAs (Tillich et al. 2006). 

RNA editing was observed in some minicircle-encoded plas- 
tid genes in dinoflagellates (Zauner et al. 2004; Dang and 
Green 2009) but was never observed in plastid-encoded 
RNAs in other algae groups so far. Nevertheless, it is common 
in land plants where it occurs at frequencies that vary across 
lineages. Seed plant plastids typically exhibit 26-54 editing 
sites (Wakasugi et al. 1996; Tillich et al. 2006; Jiang et al. 
2012). By contrast, the fern Adiantum capillus-veneris (Wolf 
et al. 2004) and the hornwort Anthoceros formosae 



(Kugita et al. 2003) harbor hundreds of sites that are post- 
transcriptionally modified by editing. At the high end of the 
spectrum, the lycophyte Isoetes engelmannii appears to have 
more than 1,500 edited sites in total (Grewe et al. 201 1). At 
the low end, the moss Physcomitrella patens maintains chloro- 
plast RNA-editing machinery for two sites only (Miyata and 
Sugita 2004; Rudinger et al. 2009), whereas in the liverwort 
Marchantia polynnorpha, no detectable RNA editing occurs at 
all (Steinhauseret al. 1999). 

What underlies this diversity in RNA-editing frequency 
among plastids? Several hypotheses have been put forth to 
explain the evolution of chloroplast RNA editing in land plants. 
Covello and Gray (1 993) proposed a three-step model for the 
evolution of RNA editing, in which first an RNA-editing activity 
appears in an enzyme, followed by mutation at editable pos- 
itions and genetic drift, and finally fixation of the edited sites 
and their corresponding editing machinery; the general under- 
lying principle of evolving complicated processes, like editing, 
without advantage was later dubbed constructive neutral evo- 
lution (Stoltzfus 1999; Gray et al. 2010). Jobson and Qiu 
(2008) took a more selectionist stance on the issue, arguing 
that RNA editing arose in land plant chloroplasts through nat- 
ural selection imposed by functional constraints on the 
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encoded proteins. Our own suggestion is that chloroplasts 
encountered a high load of mutations during the transition 
from an aquatic to a terrestrial life style and that RNA-editing 
mechanisms suppressed these mutations, thereby eliminating 
nonsynonymous amino acid changes in coding regions and 
compensating for their potentially deleterious effects in the 
chloroplast genome (Tillich et al. 2006; Maier et al. 2008). In 
addition, the incorporation of a phage-type RNA polymerase 
into plastid genome expression could have helped to over- 
come mutations in promoter regions, which are not tran- 
scribed (Maier et al. 2008). 

Regardless of the causes behind the origin of chloroplast 
editing, edited sites can be lost by back mutation (or remuta- 
tion) of the edited site into the "correct" (ancestral) base. 
Lineage-specific variation in this parameter could account 
for the different frequencies of editing across the chloroplast 
of different groups. However, in that case, the bottleneck for 
the elimination of editing sites would entail mutational dy- 
namic that vary across sequence contexts and species, and 
indeed genomic contexts with a low remutation rate (Tillich 
et al. 2006) were found for flowering plants to be especially 
TCA trinucleotides (Morton 1995, 2003; Morton etal. 1997). 
In chloroplast genomes with a high editing load, no strict con- 
sensus with respect to flanking nucleotides can be defined, 
whereas in seed plants with a low editing frequency, a biased 
RNA-editing pattern is present in which the editing site C is 
preferentially located within a TCA genomic context (Tillich 
et al. 2006). 

Marchantia polymorpha figures prominently in this issue, 
because no RNA-editing sites exist in the chloroplast genome 
(Freyer et al. 1997; Steinhauser et al. 1999), indicating that 
chloroplast functions are independent of RNA editing and that 
regulatory mechanisms in other lineages that involve editing 
might have evolved secondarily. Thus, M. polymorpha and the 
whole marchantiid subclade might never have had the "load" 
of RNA editing or might have lost it secondarily. Analysis of 
jungermanniid liverwort plastid genomes could be instructive. 
As chloroplast and mitochondrial RNA editing generally coin- 
cides (Freyer et al. 1997; Steinhauser et al. 1999; Tillich et al. 
2006), the detection of mitochondrial RNA-editing sites in 
some jungermanniid liverworts are possibly a proxy for chloro- 
plast RNA editing in this group (Groth-Malonek et al. 2007; 
Rudinger et al. 2008). In line with that view, RNA editing was 
identified in the chloroplast of the liverworts Bazzania trilobata 
and Pellia epiphylla by analyzing ndhB and rbcL transcripts 
(Freyer et al. 1997), and extensive RNA editing was observed 
in the liverwort Haplomitrium mnioides (Groth-Malonek et al. 
2005, 2007). Haplomitrium (Haplomitriopsida) was shown to 
be, to some extent, an isolated group in the clade of liver- 
worts, which most probably diverged before the separation of 
Jungermanniopsida and Marchantiopsida (Groth-Malonek 
et al. 2007). Taken together, the data suggest that M. poly- 
morpha and the whole marchantiid subclade might have lost 
RNA editing in organelles secondarily. 



To test this "secondary lost" prediction and to illuminate 
the distribution and evolution of RNA-editing sites in the clade 
of liverworts and plants in general, we sequenced the chloro- 
plast genome of the liverwort P. endiviifolia (Dicks.) Dumort 
and determined the frequency and the genomic context of 
chloroplast RNA editing. Our data support the view of a sec- 
ondary loss of RNA editing in M. polymorpha and indicate that 
chloroplast RNA editing was present in early land plants. In 
addition, we show that patterns in the RNA-editing sites in the 
chloroplast of a liverwort are comparable to those in seed 
plants, indicating retention of RNA-editing sites in a genomic 
context known to have a low remutation rate (Morton 1995, 
2003; Morton et al. 1997; Tillich et al. 2006). 

Materials and Methods 

Culture Conditions 

Pellia endiviifolia (Dicks.) Dumort was obtained from the 
Botanical Garden of the Philipps University of Marburg. 
Plants were cultured under natural light/dark cycle conditions 
at 20°C on soil. 

DMA and RNA Isolation 

Total DNA was isolated by the CTAB method (Doyle and Doyle 
1990). For RNA isolation, lOOmg of plant material was col- 
lected, frozen in liquid nitrogen, and ground to a fine powder 
with mortar and pestle. One milliliter of Trizol (Life Technol- 
ogies, Darmstadt, Germany) was added, and grinding was 
continued for 2 min. The mixture was transferred to a micro- 
centrifuge tube and spun for 10 min at 12,000 x g and 4°C. 
Supernatant was transferred into a new tube and incubated at 
room temperature for 5 min, after which 200 |il chloroform 
was added, mixed, and incubated at room temperature again 
for 5 min. After centrifugation at 12,000 x g for 10 min at 
room temperature, the aqueous phase was collected and 
transferred into a new tube. An equal volume of isopropanol 
was added, mixed, and spun for 20 min at 12,000 x g and 
4°C. The pellet was washed once with 70% ethanol, dried, 
and dissolved in 50|il H2O. Before cDNA synthesis, the RNA 
was treated with DNAsel (Roche, Munich, Germany or Fer- 
mentas, St. Leon-Roth, Germany) and reverse transcribed 
using Omniscript Reverse Transciptase, OneStep RT-PCR Kit 
(both Qiagen, Hilden, Germany), or MuLV Reverse Transcrip- 
tase (Fermentas, St. Leon-Rot, Germany). 

PCR and Sequencing 

For genome sequencing. Long PCR Enzyme Mix (Fermentas, 
St. Leon-Rot, Germany) was used. Normal PCR was made with 
Biotools DNA polymerase (Biotools, Madrid, Spain) or 
Phusion® High-Fidelity polymerase (Finnzymes, Vantaa, 
Finland). PCR products were purified with NucleoSpin Gel 
and PCR Clean-up Kit (Macherey-Nagel, Duren, Germany) 
and directly sequenced using the DYEnamic ET Terminator 
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Cycle Sequencing Kit (GE-Healthcare, Munich, Germany) on 
an ABI PRISM® 377 DNA Sequencer (Applied Biosystems, 
Darmstadt, Germany). Oligonucleotides used for PGR and 
sequencing were purchased from MWG (Ebersberg, Ger- 
many) or Sigma-Aldrich (Munich, Germany). Coverage of 
sequencing was at least 2-fold based on independent isola- 
tions. In some cases, we had to clone PGR products in pJet 
Cloning Vector (Fermentas, St. Leon-Rot, Germany). In this 
case, sequencing coverage was at least 6-fold. 

Data Assembly and Annotation 

Data were assembled and edited using the Sequencher 
4.7-5.0 (Gene Codes Corporation, Ann Arbor, Ml). Open 
reading frames or tRNAs were identified using tRNAscan SE 
(Schattner et al. 2005) and ORF Finder and Blast tools from 
National Center for Biotechnology Information. DOGMA 
(Wyman et al. 2004) and OGDraw (Lohse et al. 2007) were 
used for annotation and map drawing, respectively. 

Results and Discussion 

Pellia endiviifolia cpDNA Gene Content and Structure 

The chloroplast genome of P. endiviifolia was amplified from 
genomic DNA and sequenced yielding a contig of 1 20,546 bp 
(fig. 1 and table 1). The sequence was nonpolymorphic except 
for some differing sequences in the 3^-region of the cysT gene. 
That polymorphic stretch might indicate a chromosomal 
region involved in a switch from circular to linear chromo- 
somes as described by others (Bendich 2004; Oldenburg 
and Bendich 2004). We detected 123 genes including YCFs 
and genes for hypothetical proteins, all known from orthologs 
from other chloroplast genomes (fig. 1). In comparison to the 
chloroplast genome of M. polymorpha, the first sequenced 
chloroplast genome of a liverwort, we observed strong 
conservation in arrangement, gene order, and gene content, 
and the general borders between IR, SSC, and LSC are con- 
served as well. The chloroplast genome of P. endiviifolia 
(120,546 bp) is slightly smaller than that of M. polymorpha 
(121,025 bp), whereas the overall A/T content differs clearly 
{M. polymorpha 71.19% and P endiviifolia 64.1 1 %). Size dif- 
ference in comparison to M. polymorpha is due to various 
small deletions and insertions. Inverted repeat regions are 
more compact in P. endiviifolia, whereas at the border be- 
tween IRb and rps12-3', P. endiviifolia has a 301 -bp-long non- 
coding stretch. An obvious difference is observable in the 
coding region of ycf2, which carries ~550bp deletion in 
comparison to M. polymorpha. Smaller deletions are detect- 
able in ycfl -1 and ycfl -2 (o/f464 and orf^ 068 in M. polymor- 
pha). In general, ycf's are not well conserved between 
these two species. We further identified 21 introns in the 
chloroplast genome of P. endiviifolia, most of which are 
known to be present in other nonparasitic liverwort chloro- 
plast genomes. Differences in existing introns in comparison to 



M. polymorpha comprised an additional intron in the ycf3 
gene in P. endiviifolia and an intron close to the end of the 
coding region of cysT. 

Recently, the chloroplast genome of the jungermanniid 
liverwort Ptilidium pulcherrimum was published (Forrest 
et al. 201 1 ). By comparing their results with the then available 
data, the authors proposed a stasis in chloroplast genome 
structure of liverworts. Our data from P. endiviifolia support 
that view because the plastid genome of P. pulcherrimum is 
nearly identical with respect to the described characteristics. 

All members of the "core set" of chloroplast encoded 
genes indicating the minimal set of genes encoded by all 
chloroplast genomes of photosynthetic active organisms 
(Martin et al. 1998; Martin 2003) are present in the P end- 
iviifolia chloroplast genome. 

RNA-Editing Pattern 

In contrast to M. polymorpha, which shows no organellar RNA 
editing, we identified 54 editing sites in the chloroplast of 
P. endiviifolia by comparing mRNAs with genomic sequences. 
Editing events were observed in 26 of 87 protein-coding 
genes (table 2). For ycf 66, we have no indications of transcrip- 
tional activity, therefore no cDNA sequences could be deter- 
mined. The exact function of the protein product of ycf 66 is 
still unknown, but recently a loss of this gene was described 
for ferns where it was lost in at least four independent cases. It 
therefore appears that ycf 66 is, at least in ferns, not essential 
(Gao et al. 201 1). Nevertheless, the ycf 66 reading frame in 
P. endiviifolia is intact, indicating functional constraints. 

For all editing sites, we observed only C to U conversions. 
Two editing events restore a start methionine (ccsAeU2TM 
and rpl23eU2TM), and three were found to be only partially 
edited. Additionally, we detected two silent editing sites with 
no impact on the encoded amino acid. One of latter is edited 
partially (psbJeU60LL) (table 2). Most affected by RNA editing 
is the ndh-gene family with 1 4 editing events, followed by pet 
genes (1 0), genes encoding subunits of the photosystem ipsa/ 
psb) with 9 editing sites and atp with 8 editing events. 

We examined the genomic context in which RNA-editing 
sites occur (table 2). In seed plants, the editing sites converting 
C^U were shown to have a bias toward 5'-T_A-3' flanking 
bases (Tillich et al. 2006), a genomic context known to have a 
lower substitution rate in plastids (Morton 2003). Analysis of 
the flanking bases of the editing sites (fig. 2) reveals that 17 
show a tCa bias (31.5%), whereas in 29 cases (53.7%), we 
could observe a variation of this motif (tCN or NCa). Overall, 
we observed a 5^-pyrimidine at 96% of all edited sites (thy- 
mine in 70.4% and cytosine in 25.9%). The occurrence of a 
5^-pyrimidine was reported to result in lower substitution rate 
(Morton et al. 1997). Thus, 96% of the editing sites in 
P. endiviifolia cpDNA are embedded in a "low re-mutation 
biased" genomic context. 
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Fig. 1. — The chloroplast genome of Pellia end ivii folia. The plastome of the liverwort is displayed in a circular pattern. LSC, large single copy region; SSC, 
small single region; IRa/b, inverted repeat A/B. Color code see legend on the left. Drawing was made with OGDraw (Lohse et al. 2007). 



The editing events in the chloroplast genome of P. end- 
iviifolia lead to many nonsynonymous changes (fig. 3). 
Serine to leucine or serine to phenylalanine changes are 
most frequently observed, followed by proline to leucine 
changes. This is in line with previously observed predominant 
amino acid exchanges as a consequence of RNA editing (Tillich 
et al. 2006). The genomic context of flanking bases for the 
observed editing sites clearly illustrates why we frequently ob- 
serve the amino acid exchanges S^L, S^F, and P^L 
(fig. 3). Serine is encoded by the tCn codon. This codon is 
affected in 28 of 54 editing events (51.9%) in P. endiviifolia 
(table 2) and is known, as described, for a low remutation rate 



(Morton et al. 1 997; Morton 2003). Proline is encoded by cCn, 
but 8 of 13 editing events affecting this codon are editing 
events on cCa and thereby a sequence context of lower remu- 
tation rate as well (Morton et al. 1997; Morton 2003). These 
data indicate that the frequently observed amino acid 
exchanges, as a result of plastid RNA editing, are predomin- 
antly a consequence of a sequence context of known low 
remutation rate in chloroplast DNA. Other mutations of the 
plastid DNA might have occurred, but these resulted either in 
no exchange of the encoded amino acid or a conservative 
(functionally similar) amino acid exchange likely without nega- 
tive effects on protein function. 
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Characteristics of the Chloroplast Genome of Pellia endiviifolia 



Region Begin-End 


Length (bp) 


A/T Content (%) 


LSC 1-82,508 


82,508 


66.05 


SSC 91,601-111,454 


19,854 


66.97 


IRa 82,509-91,600 


9,092 


52.14 


IRb 111,455-120,546 


9,092 


52.14 


Complete genome 


120,546 


64.11 


Coding regions 






Protein 


73,257 


64.33 


tVrRNAs 


11,922 


46.19 


Complete coding regions 


85,179 


61.79 


Ptilidium pulcherrimum^ 


119,007 


66.77 


Marchantia polymorpha^ 


121,025 


71.19 


^Forrest et al. (2011). 
■"Ohyama et al. (1988). 



Evolutionary Implications 

The evolution of RNA editing is inconnpletely understood, and 
in particular, the question "why editing" is still open. A gen- 
eral answer that would hold for all cases and kinds of editing 
nnight not be possible due to various nnanifestations and func- 
tions of RNA editing in different compartnnents and organ- 
isms. Here, we have focused on RNA editing in chloroplasts 
and the hypothesis proposed by Tillich et al. (2006) which, in 
the main, posits the following: chloroplast RNA editing origi- 
nated in the first land plants whose plastids experienced, 
during the transition to land, a high mutation load. 
Mutations effecting coding regions were possibly compen- 
sated via RNA editing, whereas mutations in important regu- 
latory, nontranscribed c/s-acting sequences, the promoter 
regions, possibly necessitate recruitment of a chloroplast- 
directed phage-type RNA polymerase (Maier et al. 2008). 
Although RNA-editing sites could be eliminated via gen- 
omic remutations of the editing site into the "correct" 
(ancestral) base with no need for editing anymore, gains of 
editing sites might contribute to the different rate of RNA 
editing in different species as well. However, recent studies 
indicate that gains are rare (Hayes and Hanson 2008; Tillich 
etal. 2009). 

If the "Tillich" hypothesis is correct, one would predict to 
observe the following in newly analyzed land plant 
chloroplasts. 

i) RNA editing should not be detectable, caused by second- 
ary loss of the need for RNA editing. The mechanism of 
loss should be remutations, which restore the ancetral 
(pre-editing) base. 

ii) If RNA editing is observed, then either 

iia) RNA editing should present at high frequencies 
(approximately >100 editing sites), whereby the predic- 
tion is that no strong genomic context of the editing 
sites (i.e., the 5^- and 3^-flanking base) should be found 



and edits other than C to U transitions might be 
detectable. 

Alternatively: 

iib) in cases where approximately <100 editing sites per 
chloroplast genome (~30 in seed plants) occur, a strong 
tendency for the location of the editing site within TCA 
trinucleotides (with C being the edited base) and/or a gen- 
erally increasing number of 5^-pyrimidines should be 
observed, because these genomic backgrounds were 
shown to be the genomic context with the lowest remuta- 
tion rate (Morton et al. 1 997; Morton 2003). This addition- 
ally implies that only C to U transitions might be present as 
seen in the seed plant examples (Tillich et al. 2006). 

Because every cytosine edited in P. endiviifolia is a thymine in 
M. polymorpha, we can conclude that these positions remu- 
tated and led to a secondary loss of RNA editing in 
M. polymorpha. However, although closely related, not 
every C-T difference between M. polymorpha and P. endivii- 
folia coding regions reflects an actual editing site because only 
sites that are important for protein function seem to undergo 
editing. Editing frequently results in radical amino acid 
changes, an exchange of amino acids with clearly different 
physicochemical properties, which could impact protein func- 
tion (Maier et al. 1996; Jiang et al. 2012) (fig. 3). Such sites 
might require editing or remutations. Every C that is edited to 
U in P. endiviifolia mRNA is a T in M. polymorpha chloroplast 
DNA. Additionally, the flanking bases, as we see them today, 
are the same in many cases or even have a 5^-thymine in 
M. polymorpha {data not shown). Thus, chloroplast genomes 
of marchantiid liverworts might have overcome the low mu- 
tation bottleneck in the respective genomic context and now 
encode the "right" base. Therefore, it is likely that the chloro- 
plast genome of the marchantiid liverwort progenitor under- 
went a period of accelerated evolution with an increased 
substitution rate. Accelerated evolution and a corresponding 
low RNA-editing frequency have been shown for Geraniaceae 
(Parkinson et al. 2005). Such a "progressed evolution" might 
additionally be seen in the comparatively high A/T content of 
the chloroplast genome of M. polymorpha (table 1). Vice 
versa, a high editing frequency in correlation with a high 
G/C content was shown for the chloroplast genome of the 
lycophyte Selaginella (Smith 2009), congruent with our 
postulates. 

Another possibility is that RNA editing evolved after the 
diversification of liverworts. Available data for RNA editing in 
the mitochondrion genome of other liverworts and from two 
identified chloroplast RNA-editing sites in B. trilobata and 
P. epiphylla (Freyer et al. 1997) suggest that liverworts in- 
herited an RNA-editing load but secondarily lost it in the 
case of marchantiopsida (Groth-Malonek et al. 2007; 
Rudinger et al. 2008). This view is supported by our finding 
of "only" 54 editing sites in the chloroplast of the liverwort 
P. endiviifolia. Thus, it is possible that all groups of land plants 
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Table 2 

All Identified RNA-Editing Sites in the Chloroplast Genome of Pellia endiviifolia 



Gene 


Editing Sites 


Nt Position 


AA Position 


Codon 


Context 




Code 


Remarl<s 


atpB 


1 


1097 


366 


tCg 




S^L 


atpBeU1097SL 




atpF 


1 


113 


38 


cCa 




P^L 


atpFeU113PL 




atpH 


2 


188 


63 


tCa 




S^L 


atpHeU188SL 








224 


75 


cCg 




P^L 


atpHeU224PL 




atpl 


4 


292 


98 


Ctt 


tett 


L^F 


atpleU292LF 








431 


144 


cCt 




P^L 


atpleU431PL 








632 


211 


cCa 




P^L 


atpleU632PL 








704 


235 


cCa 




P^L 


atpleU704PL 




ccsA 


3 


2 


1 


aCg 




T^M 


ccsAeU2TM 


Start codon 






403 


135 


Cgg 


cCgg 




ccsAeU403RW 








482 


161 


cCt 




P^L 


ccsAeU482PL 




cIpP 


3 


81 


27 


etc 


ctet 


L 


clpPeU81LL 


Silent 






320 


107 


tea 




S^L 


clpPeU320SL 








533 


178 


tct 




S^F 


clpPeU533SF 




ndhB 


1 


1427 


476 


tct 




S^F 


ndhBeU1427SF 




ndhD 


5 


536 


179 


tCa 




S^L 


ndhDeU536SL 








548 


183 


tCa 




S^L 


ndhDeU548SL 








824 


275 


cCa 




P^L 


ndhDeU824PL 








1037 


346 


tct 




S^F 


ndhDe1037SF 








1324 


442 


Cat 


teat 


H^Y 


ndhDeU1324HY 




ndhE 


1 


242 


81 


tea 




S^L 


ndhEeU242SL 




ndhF 


5 


566 


187 


tCa 




S-L 


ndhFeU566SL 


Partial 






581 


194 


tea 




S-L 


ndhFeU581SL 








1067 


356 


tCa 




S^L 


ndhFeU1067SL 








2072 


691 


tct 




S^F 


ndhFeU2072SF 








2078 


693 


cCa 




P^L 


ndhFeU2078PL 




ndhG 


1 


133 


45 


Cgt 


tCgt 


R^e 


ndhGeU133RC 




ndhK 


1 


56 


19 


tea 




S^L 


ndhKeU56SL 




petA 


3 


308 


103 


cCa 




P^L 


petAeU308PL 


Partial 






311 


104 


tea 




S^L 


petAeU311SL 








748 


250 


Ccg 


tCcg 


P^L 


petAeU748PL 




petB 


3 


167 


56 


tec 




S^F 


petBeU167SF 








506 


169 


tea 




S^L 


petBeU506SL 








593 


198 


tet 




S^F 


petBeU593SF 




petG 


1 


80 


27 


teg 




S^L 


petGeU80SL 




petL 


2 


23 


8 


tet 




S^F 


petLeU23SF 








74 


25 


tea 




S^L 


petLeU74SL 




petN 


1 


86 


29 


teg 




S^L 


petNeU86SL 




psaB 


3 


764 


255 


cea 




P^L 


psaBeU764PL 








782 


261 


tet 




S^F 


psaBeU782SF 








1595 


532 


cea 




P^L 


psaBeU1595PL 




psbB 


2 


448 


150 


Cgt 


tCgt 


R^e 


psbBeU448Re 








740 


247 


tec 




S^F 


psbBeU740SF 




psbC 


1 


859 


287 


Cgt 


tCgt 


S^L 


psbeeU859Re 




psbJ 


1 


60 


20 


cte 


cteg 


L 


psbJeU60LL 


Silent, partial 


psbM 


2 


41 


14 


tet 




S^F 


psbMeU41SF 








47 


16 


tea 




S^L 


psbMeU47SL 




rpl23 


2 


2 


1 


aCg 




T^M 


rpl23eU2TM 


Start codon 






127 


43 


ett 


tett 


L^F 


rpl23eU127LF 




rpoB 


3 


863 


288 


tea 




S^L 


rpoBeU863SL 








1414 


472 


eat 


teat 


Y 


rpoBeU1414HY 








1940 


647 


tea 




S^L 


rpoBeU1940SL 




rpoC2 


1 


290 


97 


cet 




P^L 


rpoe2eU290PL 




yds 


1 


65 


22 


ceg 




P^L 


ycf3eU65PL 





54 



1354 GenomeBiol. Evol. 4(1 2): 1349-1 357. doi:10.1093/gbe/evs1 14 Advance Access publication Decennber4, 2012 



Chloroplast Genome of Pellia endiviifolia 



GBE 





8 



10 



12 



14 



16 



18 



20 



Fig. 3.— Amino acid exchanges resulting from plastid RNA editing. Tlie types (y axis) and quantities (x axis) of all amino acid exchanges as a result of 
plastid RNA editing in Pellia endiviifolia are shown. Editing leads to amino acid exchanges, which should have an impact on protein function because affected 
amino acids have differing biochemical properties or restore a start codon. 



might need RNA editing in the chloroplast, and only rarely 
they lose it secondarily. 

In comparison to the hornwort A. formosae or the fern 
A. capillus-veneris, a small number of editing sites is present 
in the chloroplast genome of P. endiviifolia, which is compar- 
able to the situation in seed plants. The genomic context of 



the 54 editing sites in P. endiviifolia (fig. 2) show that 1 7 pos- 
itions (31 .5%) have the edited C embedded in a TCA context. 
In addition, 21 editing sites show the edited C in a genomic 
context of TCN (38.9%) and 8 in NCA (14.8%). Only eight 
editing sites (14.8%) do not match TCN or NCA. Of the latter, 
two editing sites cure a start codon. Together with the finding 
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that in P. endiviifolia chloroplast RNA editing involves only C to 
U transitions, this supports the idea that RNA editing evolved 
in the first land plants (Maier et al. 2008). 

The chloroplast genomes of M. polymorpha (Ohyama et al. 
1 988), P. pulcherrimum (Forrest et al. 201 1 ), and P endiviifolia 
are highly conserved in their structure and gene order. Forrest 
et al. (2011) suggested "evolutionary stasis" for chloroplast 
genome structure in liverworts. However, this might be a sec- 
ondary effect, because the low editing rates or even the loss of 
RNA editing in Jungermanniopsida and Marchantiopsida, re- 
spectively, imply the contrary, namely rapid evolutionary rates 
in liverwort chloroplast genomes after splitting of the 
Haplomitrium subclade. 

The present data are consistent with the hypothesis that 
A//, polymorplia and the marchantiid liverworts secondarily lost 
the need for RNA editing in the chloroplast. This implies that 
such "primitive" land plants show a highly derived expression 
machinery in the chloroplast, one in which the mutational 
burden incurred during the transition from water to land 
was suppressed. In addition, the type of RNA editing (C to 
U), the genomic context of the editing site (TCA), and the 
amount of RNA-editing sites detected in the chloroplast 
genome of P. endiviifolia are consistent with the view that 
RNA editing initially evolved to eliminate deleterious mutations 
introduced into the first land plant plastid genomes. 
Subsequent gains of edited sites might have also contributed 
to the species-specific distribution of chloroplast RNA editing, 
but at present this contribution seems to be minor. 
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